Expected Utility Estimation via Cross-Validation

نویسندگان

  • AKI VEHTARI
  • JOUKO LAMPINEN
چکیده

We discuss practical methods for the assessment, comparison and selection of complex hierarchical Bayesian models. A natural way to assess the goodness of the model is to estimate its future predictive capability by estimating expected utilities. Instead of just making a point estimate, it is important to obtain the distribution of the expected utility estimate in order to describe the associated uncertainty. We synthesize and extend the previous work in several ways. We give a unified presentation from the Bayesian viewpoint emphasizing the assumptions made and propose practical methods to obtain the distributions of the expected utility estimates. We discuss the properties of two practical methods, the importance sampling leave-one-out and the k-fold cross-validation. We propose a quick and generic approach based on the Bayesian bootstrap for obtaining samples from the distributions of the expected utility estimates. These distributions can also be used for model comparison, for example, by computing the probability of one model having a better expected utility than some other model. We discuss how the crossvalidation approach differs from other predictive density approaches, and the relationship of cross-validation to information criteria approaches, which can also be used to estimate the expected utilities. We illustrate the discussion with one toy and two real world examples.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Model Selection via Predictive Explanatory Power

We consider model selection as a decision problem from a predictive perspective. The optimal Bayesian way of handling model uncertainty is to integrate over model space. Model selection can then be seen as point estimation in the model space. We propose a model selection method based on Kullback-Leibler divergence from the predictive distribution of the full model to the predictive distribution...

متن کامل

Bayesian Model Assessment and Comparison Using Cross-Validation Predictive Densities

In this work, we discuss practical methods for the assessment, comparison, and selection of complex hierarchical Bayesian models. A natural way to assess the goodness of the model is to estimate its future predictive capability by estimating expected utilities. Instead of just making a point estimate, it is important to obtain the distribution of the expected utility estimate because it describ...

متن کامل

Utility Estimation of Health Status of Cancer Patients by Mapping for Cost-Utility Analysis

Background: It is important to obtain accurate information about the preferences of people for measuring quality-adjusted life years (QALYs), because it is necessary for cost-utility analysis. In this regard, mapping is a method to access this information. Therefore, the purpose of this study was to map Functional Assessment of Cancer Therapy – General (FACT-G) onto Short Form Six Dimension (SF...

متن کامل

Bayesian Input Variable Selection Using Posterior Probabilities and Expected Utilities

We consider the input variable selection in complex Bayesian hierarchical models. Our goal is to find a model with the smallest number of input variables having statistically or practically at least the same expected utility as the full model with all the available inputs. A good estimate for the expected utility can be computed using cross-validation predictive densities. In the case of input ...

متن کامل

Strictly Proper Scoring Rules, Prediction, and Estimation

Scoring rules assess the quality of probabilistic forecasts, by assigning a numerical score based on the forecast and on the event or value that materializes. A scoring rule is strictly proper if the forecaster maximizes the expected score for an observation drawn from the distribution F if she issues the probabilistic forecast F , rather than any G 6= F . In prediction problems, strictly prope...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002